Picture for Tong Xiao

Tong Xiao

Jack

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

Add code
Feb 02, 2026
Viaarxiv icon

APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards

Add code
Jan 31, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Causal Autoregressive Diffusion Language Model

Add code
Jan 29, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams

Add code
Jan 14, 2026
Viaarxiv icon

REFA: Real-time Egocentric Facial Animations for Virtual Reality

Add code
Jan 07, 2026
Viaarxiv icon

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Add code
Nov 10, 2025
Figure 1 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 2 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 3 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Figure 4 for Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
Viaarxiv icon

TimeSense:Making Large Language Models Proficient in Time-Series Analysis

Add code
Nov 09, 2025
Viaarxiv icon